Tag
13 articles
Learn what a statutory demand is and how it’s being used to hold TikTok accountable for content moderation failures in Malaysia.
LinkedIn is cracking down on AI-generated 'slop' amid growing concerns over automated, low-quality content. The move highlights a contradiction with Microsoft's push for AI integration on the platform.
Elon Musk's X has committed to faster hate speech and terrorist content removal in the UK, while a separate probe into its AI assistant Grok remains open.
Apple nearly removed Elon Musk's AI app Grok from its App Store in January over its failure to curb nonconsensual sexual deepfakes, highlighting growing concerns about AI content moderation.
This article explains the AI safety mechanisms involved in detecting and responding to stalking threats, examining how advanced AI systems struggle with nuanced threat assessment and the implications for AI development and accountability.
Moonbounce raises $12 million to scale its AI control engine that translates content moderation policies into consistent AI behavior. The startup, founded by a former Facebook insider, aims to solve the challenge of maintaining predictable AI governance as AI systems take on more content moderation responsibilities.
Learn to build a real-time hate speech detection system using Python and transformer models, similar to Penemue's AI platform for identifying online hate and digital violence.
Learn to build an AI content analysis tool that can detect potentially problematic language in chatbot responses, similar to the legal issues surrounding Grok and Swiss former president Karin Keller-Sutter.
This article explains how automated AI systems for content moderation can produce erroneous takedown notices, examining the technical architecture, trade-offs, and legal implications of such systems.
Meta's Oversight Board warns that Community Notes are ill-equipped to handle the growing threat of AI-generated disinformation, especially in vulnerable regions.
OpenAI has discontinued its erotic mode for ChatGPT, following a pattern of abandoning experimental features amid regulatory pressure and ethical concerns. The move reflects the company's strategic shift toward prioritizing safety and responsible development.
Meta has launched new AI content enforcement systems to improve platform safety and reduce reliance on third-party vendors. The company claims these tools will detect more violations with greater accuracy and respond more quickly to real-world events.